Re-examining Google Tri-grams Measure (GTM) Sentence Similarity

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Similarity Using Google Tri-grams

The purpose of this paper is to propose an unsupervised approach for measuring the similarity of texts that can compete with supervised approaches. Finding the inherent properties of similarity between texts using a corpus in the form of a word n-gram data set is competitive with other text similarity techniques in terms of performance and practicality. Experimental results on a standard data s...

متن کامل

Using Sentence Similarity Measure for Plagiarism Source Retrieval

This paper describes a method that was implemented in the software submitted to PAN 2014 competition for the source retrieval task. For generating queries we use the most important noun phrases and words of sentences selected from a given suspicious document. To download documents that are likely to be sources of plagiarism we employ a sentence similarity measure.

متن کامل

Sentence Extraction by Spreading Activation with Refined Similarity Measure

Although there has been a great deal of research on automatic summarization, most methods are based on a statistical approach, disregarding relationships between extracted textual segments. To ensure sentence connectivity, we propose a novel method to extract a set of comprehensible sentences that centers on several key points. This method generates a similarity network from documents with a le...

متن کامل

A Link Grammar and Semantic Corpus Based Sentence Similarity Measure

A novel sentence similarity measure that based on grammar and semantic corpus is presented. The well-known problem in the field of semantic processing, such as natural language processing, QA systems, expert systems, search engines, etc., is trying to evaluate the semantic similarity between sentences or articles. The major challenge is to evaluate the similarity of sentence-vs.-sentence since ...

متن کامل

Sentence Similarity by Combining Explicit Semantic Analysis and Overlapping N-Grams

We propose a similarity measure between sentences which combines a knowledge-based measure, that is a lighter version of ESA (Explicit Semantic Analysis), and a distributional measure, Rouge.We used this hybrid measure with two French domain-orientated corpora collected from the Web and we compared its similarity scores to those of human judges. In both domains, ESA and Rouge perform better whe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Knowledge Engineering

سال: 2017

ISSN: 2382-6185

DOI: 10.18178/ijke.2017.3.2.091